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© Seed specific transcriptional regulation. 

© Nucleic acid sequences and methods for their use are 
provided which provide for seed specific transcription, in order 
to modulate or modify expression in seed, particularly embryo 
cells. Transcriptional initiation regions are identified and 
isolated from plant cells and used to prepare expression 
cassettes which may then be transformed into plant cells for 
seed specific transcription. The method finds particular use in 
conjunction with modifying fatty acid production in seed tissue 
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SEED SPECIFIC TRANSCRIPTIONAL REGULATION 



Description 

INTRODUCTION 

5 

^Genetic mod ification of plant material is provided for seed specific transcription. Production of endogenous 
products may be modulated or new capabilities provided. 

10 Ba Thep°rima ry emphasis in genetic modification has been directed to prokaryotes and mammalian cells. For a 
variety of reasons plants have proven more intransigent than other eukaryotic cells in the ability to genetically 
manipulate the plants. In part, this has been the result of the different goals involved, since for the most part. . 
plant modification has been directed to modifying the entire plant or a particular plant part in a live plant, as 

15 distinct from modifying cells in culture. 

For many applications, it will be desirable to provide for transcription in a particular plant part or at a 
particular time in the growth cycle of the plant. Toward this end. there is a substantial interest in identifying 
endogenous plant products whose transcription or expression are regulated in a manner of interest. In 
identifying such products, one must first look for products which appear at a particular time in the eel growth 

20 cycle or in a particular plant part, demonstrate its absence at other times or in other parts identify nucleic acid 
sequences associated with the product and then identify the sequence in the genome of the plant in order to 
obtain the 5'-untranslated sequence associated with transcription. This requires substant.al investigation m 
first identifying the particular sequence, followed by establishing that it is the correct sequence and isolating 
the desired transcriptional regulatory region. One must then prepare appropriate constructs, followed by 

25 demonstration that the constructs are efficacious in the desired manner 

Identifying such sequences is a challenging project, subject to substantial pitfalls and uncertainty. There is 
however substantial interest in being able to genetically modify plants, which justifies the substant.al 
expenditures and efforts in identifying transcriptional sequences and manipulating them to determine their 
utility. 

30 

**CTol*T*An* M o\ecute Form and Function of the Plant Genome , eds van Vloten-Doting. Groot and Hall 
Plenum Publilhing Corp. 1985, pp 555-566; Crouch and Sussex. Planta (1981) 153:64-74 ; Crouch at al.. XMol. 
Apjl Genet (1983) 2:273-283; and Simon et al., Plant Molecular Biology (1985) 5: 191-201. describe various 
as'pecrs ofBr Lica napus storage proteinsTBeachy et al., EMBO J. (1985) 4:3047-3053. Sengupta-Gopalan et 
al Proc N atl Acad . Sci. U SA (1985) 82:3320-3324; Greenwood and Chnspeels, Plant Physiol. (1985) 79.65-71 
7n d Chen et al. Proc. N atl. Ac ad. Sci.USA (1986) 83:8560-8564 describe studies concerned with seed storage 
proteinrarTd-g enetic manipulation, hckes et al.. Mo.. Gen. Genet. (1986) 205:14-22 and Fluhr et al.. Scjence 
(1986) 232:1106-1112 describe the genetic manipulation of light inducible plant genes. 

SUMMAR Y OF THE INVENTION ■■ «,„h =n^ifir 

DNA constructs are provided w hich are employed in manipulating plant cells to prov.de for seed-specific 
transcription. Particularly, storage protein transcriptional regions are joined to other than the wild-type gene 
and introduced into plant genomes to provide for seed-specific transcription. The constructs provide for 
45 modulation of endogenous products as well as production of heterologous products. 

DESCRIPTION OF THE S PECIFIC EMBODIMENTS . 

Novel DNA constructs are p rovided which allow for modification of transcription in seed, particularly in 
embryos during seed maturation. The DNA constructs comprise a regulated transcriptional .nitiat.on region 

50 associated with seed formation, preferably in association with embryogenesis and seed maturation. Of 
particular interest are those transcriptional initiation regions associated with storage proteins, such as nap.n, 
cruciferin S-conglycinin, phaseolin, or the like. The transcriptional initiation regions may be obtained from any 
convenient host, particularly plant hosts such as Brassica . e.g. napus or campestris. soybean ( Glycine max ), 
bean ( Phaseolus vulgaris ), corn (Zea mays) . cotton (Gossypium sp.). safflower (Carthamus tinctonus), tomato 

55 (Lvcopersican esculentum), and Cuphea species. 

Downstream from and u nder thilrlRscriptional initiation regulation of the seed specific region will be a 
sequence of interest which will provide for modification of the phenotype of the seed, by modulating the 
production of an endogenous product, as to amount, relative distribution, or the ! lk £ J 3 ' 
heterologous expression product to provide for a novel function or product in the seed. The DNA construct w.M 

60 also provide for a termination region, so as to provide an expression cassette into which a 8™ may be 
introduced. Conveniently, transcriptional initiation and termination regions may be provided separated ,n he 
direction of transcription by a linker or polylinker having one or a plurality of restnct.on s.tes for insertion of the 
gene to be under the transcriptional regulation of the regulatory regions. Usually, the hnker w.ll have from 1 to 
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10, more usually from about 1 to 8, preferably from about 2 to 6 restriction sites. Generally, the linker will be 
fewer than 100 bp, frequently fewer than 60 bp and generally at least about 5 bp. 

The transcriptional initiation region may be native or homologous to the host or foreign or heterologous to 
the host. By foreign is intended that the transcriptional initiation region is not found in the wild-type host into 
which the transcriptional initiation region is introduced. 

Transcriptional initiation regions of particular interest are those associated with the Brassica napus or 
campestris napin genes, acyl carrier proteins, genes that express from about day 7 to day 40 in seed, 
particularly having maximum expression from about day 10 to about day 20, where the expressed gene is not 
found in leaves, while the expressed product is found in seed in high abundance. 

The transcriptional cassette will include in the 5'-3' direction of transcription, a transcriptional and 
translations initiation region, a sequence of interest, and a transcriptional and translational termination region 
functional in plants. One or more introns may also be present. The DNA sequence may have any open reading 
frame encoding a peptide of interest, e.g. an enzyme, or a sequence complementary to a genomic sequence, 
where the genomic sequence may be an open reading frame, an intron, a non-coding leader sequence, or any 
other sequence where the complementary sequence will inhibit transcription, messenger RNA processing, 
e.g. splicing, or translation. The DNA sequence of interest may be synthetic, naturally derived, or combinations 
thereof. Depending upon the nature of the DNA sequence of interest, it may be desirable to synthesize the 
sequence with plant preferred codons. The plant preferred codons may be determined from the codons of 
highest frequency in the proteins expressed in the largest amount in the particular plant species of interest. 

In preparing the transcription cassette, the various DNA fragments may be manipulated, so as to provide for 
the DNA sequences in the proper orientation and, as appropriate, in the proper reading frame. Toward this 
end, adapters or linkers may be employed for joining the DNA fragments or other manipulations may be 
involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites, or 
the like. For this purpose, in vitro mutagenesis, primer repair, restriction, annealing, resection, ligation, or the 
like may be employed, where insertions, deletions or substitutions, e.g. transitions and transversions, may be 
involved. 

The termination region which is employed will be primarily one of convenience, since the termination regions 
appear to be relatively interchangeable. The termination region may be native with the transcriptional initiation 
region, may be native with the DNA sequence of interest, or may be derived from another source. Convenient 
termination regions are available from the Ti-plasmid of A. tumefaciens , such as the octopine synthase and 
nopaline synthase termination regions. 

By appropriate manipulations, such as restriction, chewing back or filling in overhangs to provide blunt 
ends, ligation of linkers, or the like, complementary ends of the fragments can be provided for joining and 
ligation. 

In carrying out the various steps, cloning is employed, so as to amplify the amount of DNA and to allow for 
analyzing the DNA to ensure that the operations have occurred in a proper manner. A wide variety of cloning 
vectors are available, where the cloning vector includes a replication system functional in E. coli and a marker 
which allows for selection of the transformed cells. Illustrative vectors include pBR332, pUC"series, M13mp 
series, pACYC184, etc. Thus, the sequence may be inserted into the vector at an appropriate restriction 
site(s), the resulting plasmid used to transform the E. coli host, the E. coli grown in an appropriate nutrient 
medium and the ceils harvested and lysed and the plasmid recovered. Analysis may involve sequence analysis, 
restriction analysis, electrophoresis, or the like. After each manipulation the DNA sequence to be used in the 
final construct may be restricted and joined to the next sequence, where each of the partial constructs may be 
cloned in the same or different plasmids. 

In addition to the transcription construct, depending upon the manner of introduction of the transcription 
construct into the plant, other DNA sequences may be required. For example, when using the Ti- or Ri-piasmid 
for transformation of plant cells, as described below, at least the right border and frequently both the right a 
left borders of the T-DNA of the Ti-and Ri-plasmids will be joined as flanking regions to the transcription 
construct. The use of T-DNA for transformation of plant ceils has received extensive study and is amply 
described in EPA Serial No. 120,516. Hoekema, In: The Binary Plant Vector System Offset-drukkerij Kanters 
B.V., Albiasserdam, 1985, Chapter V, Fraley, et aL. Crit. Rev. P lant Sci., 4:1-46, and An et al., EMBO J (1985) 
4:277-284. 1 

Alternatively, to enhance integration into the plant genome, terminal repeats of transposons may be used as 
borders in conjunction with a transposase. In this situation, expression of the transposase should be 
inducible, or the transposase inactivated, so that once the transcription construct is integrated into the 
genome, it should be relatively stably integrated and avoid hopping. 

The transcription construct will normally be joined to a marker for selection in plant cells. Conveniently, the 
marker may be resistance to a biocide, particularly an antibiotic, such as kanamycin, G418, bleomycin, 
hygromycin, chloramphenicol, or the like. The particular marker employed will be one which will allow for 
selection of transformed cells as compared to cells lacking the DNA which has been introduced. 

A variety of techniques are available for the introduction of CNA into a plant cell host. These techniques 
include transformation with Ti-DNA employing A. tumefaciens or A. rhizogenes as the transforming agent, 
protoplast fusion, injection, electroporation, etc. For transformation with Agrobacterium , plasmids can be 
prepared in E. coli which plasmids contain DNA homologous with the Ti-plasmid, particularly T-DNA. The 
plasmid may or may not be capable of replication in Agrobacterium , that is, it may or may not have a broad 



0 255 378 



15 



20 



25 



30 



transformed organism used for transforming plant cells ~ ^^Dl and the resulting 
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The following examples are offered by way of illustration and not by way of limitation. 



40 



45 



50 



55 



60 



4 



0 255 378 



EXPERIMENTAL 
Materials and Methods 

Cloning Vectors 5 

Cloning vectors used include the pUC vectors, pUC8 and pUC9 (Vieira and Messing, Gene (1982) 
19:259-268); pUC18 and pUC19 (Norrander et al., Gene (1983) 26:101-106; Yan isch- Perron etal., "Gene (1985) 
33:103-119), and analogous vectors exchanging chloramphenicol resistance (CAM) as a marker for the 
ampicillin resistance of the pUC plasmids described above (pUC-CAM [pUC12-Cm t pUC13-Cm] Buckley, D. 10 
Ph.D. Thesis, U.C.S.D., CA 1985). The multiple cloning sites of pUC18 and pUC19 vectors were exchanged with 
those of pUC-CAM to create pCGN565 and pCGN566 which are CAM resistant. Also used were pUC118 and 
pUC119, which are respectively. pUC18 and pUC19 with the intergenic region of M13, from an HgiAl site at 
5465 to the Ahalll site at 5941, inserted at the Ndel site of pUC. (Available from Vieira J. and~Messing, J. 
Waksman Institute, Rutgers University, Rutgers, N.J.) '15 

Materials 

Terminal deoxynucleotide transferase (TDT), RNaseH, E. coli DNA polymerase, T4 kinase, and restriction 
enzymes were obtained from Bethesda Research Laboratories; E. coli DNA ligase was obtained from New 
England Biolabs; reverse transcriptase was obtained from Life Sciences, Inc.; isotopes were obtained from 20 
Amersham; X-gal was obtained from Bachem, Inc. Torrance, CA. 



Example I 

Construction of a Napin Promoter 25 

There are 298 nuciotides upstream of the ATG start codon of the napin gene on the pgN1 clone, a 3.3 kb 
EcoRI fragment of B. napus genomic DNA containing a napin gene cloned into p(JC8 (available from Marti 
Crouch, University of Indiana). pgN1 DNA was digested with EcoRI and SstI and ligated to EcoRI/Sstl digested 
pCGN706. (pCGN706 is an Xhol/Pstl fragment containing 3' and polyadenylation sequences of aTother napin 
cDNA clone pN2 (Crouch et a}., 1983 supra ) cloned in pCGN566 at the Sail and Pstl sites.) The resulting clone 30 
PCGN707 was digested with Salt and treated with the enzyme Bal31 to remove some of the coding region of 
the napin gene. The resulting resected DNA was digested with Sma l after the Bal31 treatment and religated. 
One of the clones, pCGN713, selected by size, was subcloned by Eco RI and Bam HI digestion into both 
EcoRI/BamHI digested pEMBL18 (Dente et al., Nucleic Acids Res. (1983) ]V. 1645-1655) and pUC118 to give 
E418 and E41 18 respectively. The extent of Bal3l digestion was conformed by Sanger dideoxy sequencing of 35 
E418 template. The Bal3l deletion of the promoter region extended only to 57 nucleotides downstream of the 
start codon, thus containing the 5' end of the napin coding sequence and about 300 bp of the 5' non-coding 
region. E41 18 was tailored to delete all of the coding region of napin including the ATG start codon by in vitro 
mutagenesis by the method of Zoll er and Smith ( Nucleic Acids Res. (1982) 20:6487-6500) using~aiT 
oligonucleotide primer 5'-GATGTTTTGTATGTGGGCCCCTAGGAGATC-3'. Screening for the appropriate 40 
mutant was done by two transformations into E. coli strain JM83 (Messing J.. In: Recombinant DNA Technical 
Bulletin, NIH Publication No. 79-99, 2 No. 2, 1979, pp 43-48) and Sma l digestion of putative transformants. The 
resulting napin promoter clone is pCGN778 and contains 298 nucleotides from the EcoRI site of pgN1 to the A 
nucleotide just before the ATG start codon of napin. The promoter region was subcloned into a 
chloramphenicol resistant background by digestion with EcoRI and Bam HI and ligation to EcoRI/BamHI 45 
digested pCGN565 to give pCGN779c. 

Extension of the Napin Promoter Clone 

pCGN779c contains only 298 nucleotides of potential 5'-regulatory sequence. The napin promoter was 
extended with a 1 .8 kb fragment found upstream of the 5'-EcoRI site on the original XBnNa clone. The - 3.5 kb 50 
Xhol fragment of XBnNa (available from M. Crouch), which includes the napin region, was subcloned into 
Sall-digested pUC119 to give pCGN930. A Hind III site close to a 5' Xhol site was used to subclone the 
Hindlll/EcoRI fragment of pCGN930 into Hind l II/ EcoRI digested Bluescript + (Vector Cloning Systems, San 
Diego, CA) to give pCGN942. An extended napin promoter was made by ligating pCGN779c digested with 
EcoRI and Pstl and pCGN942 digested with EcoRI and Pstl to make pCGN943. This promoter contains -2.1 55 
kb of sequence upstream of the original ATG of the napin gene contained on XBnNa. A partial sequence of the 
promoter region is shown in Figure 1. 

Napin Cassettes 

The extended napin promoter and a napin 3'-regulatory region is combined to make a napin cassette for 60 
expressing genes seed-specifically. The napin 3-region used is from the plasmid pCGN1924 containing the 
Xhol/ EcoRI fragment from pgN1 ( Xho l site is located 18 nucleotides from the stop codon of the napin gene) 
subcloned into EcoRI/Sall digested pCGN565. Hindlll/Pstl digested pCGN943 and pCGN1924 are ligated to 
make the napin cassette pCGN744. with unique cloning sites Sma l, Sai l, and Pstl for inserting genes. 
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Construction of cDNA Library from Spinach Leaves 

Total RNA was extracted from young spinach leaves in 4M guanidine thiocyanate buffer as described by 
Facc.otti et al. ( Biotechnology (1985) 3:241-246). Total RNA was subjected to oligo(dT)-ce!lulose column 
chromatography two times to yield poly(A) + RNA as described by Maniatis et a!., (1982) Molecular Cloning- A 
Laboratory Manual, Cold Spring Harbor Laboratory, New York. A cDNA library was cons tructed in pUC13-Cm 
aaccordtng to the method of Gubler and Hoffman. ( Gene (1983) 25:263-269) with slight modifications RNasin 
was omitted in the synthesis of first strand cDNA as it interfered with second strand synthesis if not completely 
removed, and dCTP was used to tail the vector DNA and dGTP to tail double-stranded cDNA instead of the 
reverse as described in the paper. The annealed cDNA was transformed to competent E coli JM83 (Messinq 
(1979) supra) cells according to Hanahan ( J. Mol. Biol. (1983) 166:557-580) and spread dnto LB agar plates 
(Miller (1972) Experiments in Molecular Genetics. Cold Spring Harbor Laboratory, Cold Spring Harbor New 
York),- containing 50 ug/ml chloramphenicol and 0.0050/o X-Gal. 

Identification of Spinach ACP-I cDNA 

A total of approximately 8000 cDNA clones were screened by performing Southern blots (Southern J Mol 
BioLJ1975) 98:503) and dot blot (described below) hybridizations with clone analysis DNA from 40 pools 
representing 200 cDNA clones each (see below). A 5' end labeled synthetic oligonucleotide (ACPP4) that is at 
. 4 least 660/0 homologous with a 16 amino acid region of spinach ACP-I (5'-GATGTCTTGAGCCTTGTCCTCATC- 
CACATTGATACCAAACTCCTCCTC-3') is the complement to a DNA sequence that could encode the 16 amino' 
20 - acid peptide glu-glu-glu-phe-gly-ile-asn-val-asp-glu-asp-lys-ala-gln-asp-ile, residues 49-64 of spinach ACP-I 
(Kuo and Ohlrogge, Arch. Biochem. Biophys. (1984) 234:290-296) and was used for an ACP probe 

Clone analysis DNA for Southern and dot blot hybridizations was prepared as follows. Transformants were 
transferred from agar plates to LB containing 50 ug/ml chloramphenicol in groups of ten clones per 10 ml 
media. Cultures were incubated overnight in a 37° C shaking incubator and then diluted with an equal volume of 
25* media and allowed to grow for 5 more hours. Pools of 200 cDNA clones each were obtained by mixing 
contents of 20 samples. DNA was extracted from these cells as described by Birnboim and Doly ( Nucleic 
Acids Res - (1979 > 7:1513-1523). DNA was purified to enable digestion with restriction enzymes by extractions 
with phenol and chloroform followed by ethanol precipitation. DNA was resuspended in sterile distilled water 
and 1 ug of each of the 40 pooled DNA samples was digested with Eco RI and Hindlll and electrophoresed 
through 0.7o/ 0 agarose gels. DNA was transferred to nitrocellulose filters following the biot hybridization 
technique of Southern. 

ACPP4 was 5' end-labeled using y- 32 P dATP and T4 kinase according to the manufacturer's specifications 
Nitrocellulose filters from Southern blot transfer of clone analysis DNA were hybridized (24 hours 42° C) and 
washed according to Berent et al. ( BioTechniques (1985) 3:208-220). Dot blots of the same set of DNA pools 
were prepared by applying 1 ug of each DNA pool to nylon membrane filters in 0.5 M NaOH. These blots were 
hybridized with the probe for 24 hours at 42°C in 500/o formamide/10/o SDS/1 M NaCL, and washed at room 
temperature in 2X SSC/0.10/ 0 SDS (1X SSC = 0.15M NaCI; 0.01 5M Na citrate; SDS-sodium dodecyisulfate) 
DNA from the pool which was hybridized by the ACPP4 oligoprobe was transformed to JM83 cells and plated 
as above to yield individual transformants. Dot blots of these individual cDNA clones were prepared by 
applying DNA to nitrocellulose filters which were hybridized with the ACPP4 oligonucleotide probe and 
analyzed using the same conditions as for the Southern blots of pooled DNA samples. 

Nucleotide Sequence Analysis 

The positive clone, pCGNISOL, was analyzed by digestion with restriction enzymes and the followinq partial 
45 map was obtained. 

pUC13-Czn |-35-| 248 I-63-I 152 I -200 I 
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55 E- EcoRI S-Sall X-Xbal Sm-Smal 

B-BamHI Ss -Ss t l *-former PstI site destroyed 

with tailing 

60 

polylinker with available restriction sites indicated 

The cDNA clone was subcloned into pUC118 and pUC119 using standard laboratory techniques of 
65 restriction, ligation, transformation, and analysis (Maniatis et al., (1982) supra ). Single-stranded DNA template 
was prepared and DNA sequence was determined using the Sanger dideoxy technique (Sanger et al (1977) 
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Proc. Nat. Acad. Sci. USA 74:5463-5467). Sequence analysis was performed using a software package from 
Intelli-Genetics, Inc. 

pCGNISOL contains an (approximately) 700 bp cDNA insert including a stretch of A residues at the 3' 
terminus which represents the pofy(A) tail of the mRNA. An ATG codon at position 61 is presumed to encode 
the MET translation initiation codon. This codon is the start of a 41 1 nucleotide open reading frame, of which, 5 
nucleotides 229-471 encode a protein whose amino acid sequence corresponds almost perfectly with the pub 
lished amino acid sequence of ACP-I of Kuo and Ohlrogge supra as described previously. In addition to mature 
protein, the pCGNISOL also encodes a 56 residue transit peptide sequence, as might be expected for a 
nuclear-encoded chloroplast protein. 

10 

Napin - ACP Construct 

pCGN796 was constructed by ligating pCGNISOL digested with Hindlll/ Bam HI, pUC8 digested with Hindlll 
and Bam HI and pUCl18 digested with Bam HI. The ACP gene from pCGN796 was transferred into a 
chloramphenicol background by digestion with Bam HI and ligation with Bam HI digested pCGN565. The 
resulting pCGN1902 was digested with EcoRI and Smal and ligated to Eco RI/ Sma l digested pUC1 18 to give 15 
pCGN1920. The ACP gene in pCGN1920 was digested at the Nco l site, filled in by treatment with the Klenow 
fragment, digested with Sma l and religated to form pCGN1919. This eliminated the 5'-coding sequences from 
the ACP gene and regenerated the ATG. This ACP gene was flanked with Pstl sites by digesting pCGN1919 
with Eco RI, filling in the site with the Klenow fragment and ligating a Pstl linker. This clone is called pCGN945. 

The ACP gene of pCGN945 was moved as a Bam HI/Pstl fragment to pUC1 18 digested with Bam HI and Pstl 20 
to create pCGN945a so that a Sma l site (provided by the pUC118) would be at the 5'-end of the ACP 
sequences to facilitate cloning into the napin cassette pCGN944. pCGN945a digested with Smal and Pstl was 
ligated to pCGN944 digested with Sma l and Pstl to produce the napin ACP cassette pCGN946. The na"pin ACP 
cassette was then transferred into the binary vector pCGN783 by cloning from the Hindlll site to produce 
PCGNS48. 25 

Construction of the Binan/ Vector pCGN783 

pCGN783 is a binary plasmid containing the left and right T-DNA borders of A. tumefaciens (Barker et al., 
Plant Mol. Biol. (1983) 2:335-350); the gentamicin resistance gene of pPHUI (Hirsch et al., Plasmid (1984), 
12:139-141) the 35S promoter of cauliflower mosaic virus (CaMV) (Gardner et al., Nucleic Acids Res. (1981) 30 
9:2871-2890), the kanamycin resistance gene of Tn5 (Jorgenson et al., infra and Woiff et al., ibid (1985) 
13:355-367) and the 3' region from transcript 7 of pTiA6 (Barker et al.. supra (1983)). 

To obtain the gentamicin resistance marker, the gentamicin resistance gene was isolated from a 3.1 kb 
Eco Ri-Pstl fragment of pPHIJI and cloned into pUC9 yielding pCGN549. The Hindlll- Bam HI fragment 
containing the gentamicin resistance gene was substituted for the Hindlll-Bglll fragment of pCGN587 creating 35 
PCGN594. 

pCGN537 was prepared as follows: The Hindlll-Smal fragment of Tn5 containing the entire structural gene • 
for APH M (Jorgenson eta}., Mol. Gen. Genet. (1979) 177:65) was cloned into pUC8 (Vieira and Messing, Gene 
(1982) 19:259), converting the fragment into a Hindlll- Eco RI fragment, since there is an EcoRI site immediately 
adjacent to the Sma l site. The Pstl- Eco RI fragment containing the 3'-portion of the APHII gene was then 40 
combined with an EcoRl- Bam HI-Sall-Pstl linker into the EcoRI site of pUC7 (pCGN546W). Since this construct 
does not confer kanamycin resistance, kanamycin resistance was obtained by inserting the Bglll-Pstl fragment 
of the APH II gene into the BamHI-Pstl site (pCGN546X). This procedure reassembles the APHII gene, so that 
Eco sites flank the gene. An ATG codon was upstream from and out of reading frame with the ATG initiation 
codon of APH II. The undesired ATG was avoided by inserting a Sau 3A-Pstl fragment from the 5'-end of APHII, . 45 
which fragment lacks the superfluous ATG, into the Bam HI-Pstl site of pCGN546W to provide plasmid 
pCGN550. 

The Eco RI fragment containing the APH II gene was then cloned into the unique Eco RI site of pCGN451, 
which contains an octopine synthase cassette for expression, to provide pCGN552 (1ATG). 

pCGN451 includes an octopine cassette which contains about 1556 bp of the 5' non-coding region fused via 50 
an Eco RI linker to the 3' non-coding region of the octopine synthase gene of pTiA6. The pTi coordinates are 
11,207 to 12,823 for the 3' region and 13,643 to 15,208 for the 5' region as defined by Barker et al.. Plant Mol. 
Biol. (1983) 2:325. 

The 5' fragment was obtained as follows. A small subcloned fragment containing the 5' end of the coding 
region, as a Bam HI-EcoRI fragment was cloned in pBR322 as plasmid pCGN407. The BamHI-EcoR! fragment 55 
has an Xmn l site in the coding region, while pBR322 has two Xmn l sites. pCGN407 was digested with Xmn l, 
resected with Bal3l nuclease and Eco RI linkers added to the fragments. After EcoRI and Bam HI digestion, the 
fragments were size fractionated, the fractions cloned and sequenced. In one case, the entire coding region 
and 10 bp of the 5' non-translated sequences had been removed leaving the 5' non-translated region, the 
mRNA cap site and 16 bp of the 5' non-translated region (to a Bam HI site) intact. This small fragment was 60 
obtained by size fractionation on a 7°/o acrylamide gel and fragments approximately 130 bp long eluted. 

This size fractionated DNA was ligated into M13mp9 and several clones sequenced and the sequence 
compared to the known sequence of the octopine synthase gene. The M13 construct was designated pT4, 
which plasmid was digested with Bam HI and Eco RI to provide the small fragment which was ligated to a Xho l 
to Bam HI fragment containing upstream 5' sequences from pTiA6 (Garfinkel and Nester, J. Bacterioi . (1980) 65 
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144:732) and to an EcoRI to Xhol fragment containing the 3' sequences. 

The resulting Xhol fragment was cloned into the Xho l site of a pUC8 derivative, designated pCGN426. This 
plasmid differs from pUC8 by having the sole Eco RI site filled in with DNA polymerase I, and having lost the Pstl 
and Hindlll site by nuclease contamination of Hindi restriction endonuclease, when a Xho l linker was inserted 

5 into the unique Hindi site of pUC8. The resulting plasmid pCGN451 has a single EcoRI site for the insertion of 
protein coding sequences between the 5' non-coding region (which contains 1,550 bp of 5' non-transcribed 
sequence including the right border of the T-DNA, the mRNA cap site and 16 bp of 5' non-translated 
sequence) and the 3' region (which contains 267 bp of the coding region, the stop codon, 196 bp of 3' 
non-translated DNA, the polyA site and 1,153 bp of 3' non-transcribed sequence). pCGN451 also provides the 

10 right T-DNA border. 

The resulting plasmid pCGN451 having the ocs 5' and the ocs 3' in the proper orientation was digested with 
EcoRI and the EcoRI fragment from pCGN551 containing the intact kanamycin resistance gene inserted into 
the Eco RI site to provide pCGN552 having the kanamycin resistance gene in the proper orientation. 
This ocs/KAN gene was used to provide a selectable marker for the trans type binary vector pCGN587. 

15 The 5' portion of the engineered octopine synthase promoter cassette consists of pTiA6 DNA from the Xho l 
at bp 15208-13644 (Barker's numbering), which also contains the T-DNA boundary sequence (border) 
implicated in T-DNA transfer. In the plasmid pCGN587 t the ocs/KAN gene from pCGN552 provides a 
selectable marker as well as the right border. The left boundary region was first cloned in M13mp9 as a 
Hindlll-Smal piece (pCGN502) (base pairs 602-2213) and recloned as a Kpnl-EcoRI fragment in pCGN565 to" 

20 provide pCGN580. pCGN565 is a cloning vector based on pUC8-Cm, but con taining pUC18 linkers. pCGN580 
was linearized with BamHI and used to replace the smaller Bglll fragment of pVCK102 (Knauf and Nester, 
Plasmid (1982) 8:45), creating pCGN585. By replacing the smaller Sail fragment of pCGN585 with the Xho l 
fragment from pCGN552 containing the ocs/KAN gene, pCGN587 was obtained. 
The pCGN594 Hindlll- Bam HI region, which contains an 5'-ocs-kanamycin-ocs-3' (ocs is octopine synthase 

25 with 5' designating the promoter region and 3' the terminator region, see U.S. application serial no. 775,923, 
filed September 13, 1985) fragment was replaced with the Hindlll- Bam HI polylinker region from pUC18. 

PCGN566 contains the EcoRI-Hindlll linker of pUC18 inserted into the EcoRl-Hindlll sites of pUC13-Cm. The 
Hindlll-Bglll fragment of pNW31C-8,29-1 (Thomashow et al., Cell (1980) 19:729) containing ORF1 and -2 of 
pTiA6 was subcloned into the Hindlll- Bam HI sites of pCGN566 producing pCGN703. 

30 The Sau3A fragment of pCGN703 containing the 3' region of transcript 7 (corresponding to bases 2396-2920 
of pTiA6 (Barker et al., (1983) supra ) was subcloned into the Bam HI site of pUC18 producing pCGN709. The 
EcoRI-Smal polylinker region of pCGN709 was substituted with the EcoRI- Sma l fragment of pCGN587, which 
contains the kanamycin resistance gene ( APH 3-M) producing pCGN726. 
The EcoRI-Sall fragment of pCGN726 plus the Bglll-EcoRI fragment of pCGN734 were inserted into the 

35 Bam HI-Sall site of pUC8-Cm producing pCGN738. pCGN726c is derived from pCGN738 by deleting the 900 bp 
EcoRI-EcoRI fragment. 

To construct pCGN167, the AM fragment of CaMV (bp 7144-7735) (Gardner et al., Nucl. Acid Res. (1981) 
9:2871-2888) was obtained by digestion with Alui and cloned into the Hindi site of Ml3mp7 (Messing et a!., 
Nucl. Acids Res. (1981) 9:309-321) to create C614. An EcoRI digest of C614 produced the EcoRI fragment 
40 from C614 containing the 35S promoter which was cloned into the Eco RI site-of pUC8 (Vieira and Messing, 
Gene (1982) 19:259) to produce pCGN146. 

To trim the promoter region, the Bglll site (bp 7670) was treated with Bglll and resected with Bal31 and 
subsequently a Bglll linker was attached to the Bal31 treated DNA to produce pCGN147, 

pCGN148a containing a promoter region, selectable marker (KAN with 2 ATG's) and 3' region, was prepared 
45 by digesting pCGN528 with Bglll and inserting the BamHI-Bglll promoter fragment from pCGN147. This 
fragment was cloned into the Bglll site of pCGN528 so that the Bglll site was proximal to the kanamycin gene of 
pCGN528. 

The shuttle vector used for this construct, pCGN528, was made as follows. pCGN525 was made by 
digesting a plasmid containing Tn5 which harbors a kanamycin gene (Jorgenson et al., Mol. Gen. Genet. 1979) 

50 HZ :65 > witn Hindlll- Bam HI and inserting the Hindlll- Bam HI fragment containing the kanamycin gene into the 
Hindlll- Bam HI sites in the tetracycline gene of pACYC184 (Chang and Cohen, J. Bacterioi. (1978) 
134:1141-1156). pCGN526 was made by inserting the Bam HI fragment 19 of pTiA6 (Thomashow et al., Cell 
(1980) 19:729-739), modified with Xhol linkers inserted into the Sma l site, into the Bam HI site of pCGN525. 
pCGN528 was obtained by deleting the small Xho l fragment from pCGN526 by digesting with Xho l and 

55 ' reiigating. 

pCGN149a was made by cloning the Bam HI-kanamycin gene fragment from pMB9KanXXI,into the Bam HI 
site of pCGN148a. 

pMB9KanXXI is a pUC4K variant (Vieira and Messing, Gene (1982) 19:259-268) which has the Xho l site 
missing but contains a functional kanamycin gene from Tn903 to allow for efficient selection in Agrobacterium . 

60 pCGN149a was digested with Bglll and Sphl. This small Bglll- Sph l fragment of pCGN149a was replaced with 
the Bam HI- Sph l fragment from Ml (see below) isolated by digestion with Bam HI and Sph l. This produces 
pCGN167, a construct containing a full length CaMV promoter, 1 ATG-kanamycin gene, 3' end and the bacterial 
Tn903-type kanamycin gene. Ml is an EcoRI fragment from pCGN546X (see construction of pCGN587) and 
was cloned into the Eco RI cloning site of M13mp9 in such a way that the Pstl site in the 1 ATG-kanamycin gene 

65 was proximal to the polylinker region of M13mp9. 
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The HindHI- Bam HI fragment in the pCGN167 containing the CaMV-35S promoter, 1ATG-kanamycin gene 
and the BamHI-fragment 19 of pTiA6 was cloned into the BamHI-Hindlll sites of pUC19 creating pCGN976. The 
35S promoter and 3' region from transcript 7 was developed by inserting a 0.7 kb Hindlll-EcoRI fragment of 
pCGN976 (35S promoter) and the 0.5 kb EcoRI-Saul fragment of pCGN709 (transcript 7:3') into the Hindlll-Sall 
sites of pCGN566 creating pCGN766c. 5 

The 0.7 kb Hindlll-EcoRI fragment of pCGN766c (CaMV-35S promoter) was ligated to the 1.5 kb EcoRI-SaH 
fragment in pCGN726c (1ATG-KAN 3' region) followed by insertion into the Hindlll-Sall sites of pUC119~to 
produce pCGN778. The 2.2 kb region of pCGN778, Hindlll-Sall fragment containing the CaMV-35S promoter 
and 1 ATG-KAN-3' region was used to replace the Hindlll-Sall linker region of pCGN739 to produce pCGN783. 

10 

Transfer of the Binary Vector pCGN948 into Agrobacterium 

pCGN948 was introduced into Agrobacterium tumefaciens EHA101 (Hood et al., J. Bacteriol . (1986) 
168:1291-1301) by transformation. An overnight 2 ml culture of EHA101 was grown in MG/L broth at 30°C. 0.5 
mi was inoculated into 100 ml of MG/L broth (Garfinkfel and Nester, J. Bacteriol . (1980) 144:732-743) and grown 
in a shaking incubator for 5 h at 30° C. The cells were pelleted by centrifugation at 7K, resuspended in 1 ml of 15 
MG/L broth and placed on ice. Approximately, 1 ug of pCGN948 DNA was placed in 100 ul of MG/L broth to 
which 200 of the EHA101 suspension was added; the tube containing the DNA-cell mix was immediately 
placed into a dry ice/ethanol bath for 5 minutes. The tube was quick thawed by 5 minutes in 37° C water bath 
followed by 2 h of shaking at 30° C after adding 1 ml of fresh MG/L medium. The cells were pelleted and spread 
onto MG/L plates (1.5% agar) containing 100 mg/l gentamicin. Plasmid DNA was isolated from individual 20 
gentamicin-resistant colonies, transformed back into E. coli , and characterized by restriction enzyme analysis 
to verify that the gentamicin-resistant EHA101 contained intact copies of pCGN948. Single colonies are picked 
and purified by two more streakings on MG/L plates containing 100 mg/l gentamicin. 

Transformation and Regeneration of B. Napus 25 

Seeds of Brassica napus cv Westar were soaked in 95% ethanol for 4 minutes. They were sterilized in 1o/o 
solution of sodium hypochlorite with 50 jil of Tween 20" surfactant per 100 ml sterilent solution. After soaking 
for 45 minutes, seeds were rinsed 4 times with sterile distilled water. They were planted in sterile plastic 
boxes 7 cm wide, 7 cm long, and 10 cm high (Magenta) containing 50 ml of 1/10th concentration of MS 
(Murashige minimal organics medium, Gibco) with added pyridoxine (500 ug/l), nicotinic acid (50 ug/l), glycine 30 
(200 ug/l) and solidified with 0.6% agar. The seeds germinated and were grown at 22° C in a 16h-8h light-dark 
cycle with light intensity approximately 65 uEm- 2 s- 1 . After 5 days the seedlings were taken under sterile 
conditions and the hypocotyls excised and cut into pieces of about 4 mm in length. The hypocotyl segments 
were placed on a feeder plate or without the feeder layer on top of a fitter paper on the soldified B5 0/1/1 or 
B5 0/1/0 medium. B5 0/1/0 medium contains B5 salts and vitamins (Gamborg, Miller and Ojima, Experimental 35 
Cell Res. (1968) 50:151-158), 3% sucrose, 2,4-dichlorophenoxyacetic acid (1.0 mg/l), pH adjusted to 5.8, and 
the medium is solidifed with O.6O/0 Phytagar; B5 0/1/1 is the same with the addition of 1.0 mg/l kinetin. Feeder 
plates were prepared 24 hours in advance by pipetting 1.0 ml of a stationary phase tobacco suspension culture 
(maintained as described in Fiilatti et al., Molecular General Genetics (1987) 206:192-199) onto B5 0/1/0 or 
B5 0/1/1 medium. Hypocotyl segments were cut and placed on feeder plates 24 hours prior to Agrobacterium 40 
treatment. 

Agrobacterium tumefaciens (strain EHA101 x 948) was prepared by incubating a single colony of 
Agrobacterium in MG/L broth at 30°C. Bacteria were harvested 16 hours later and dilutions'of 10 8 bacteria per 
mi were prepared in MG/L broth. Hypocotyl segments were inoculated with bacteria by placing the segments • 
in an Agrobacterium suspension and allowing them to sit for 30-60 minutes, then removing and transferring to 45 
Petri plates containing 35 0/1/1 or 0/1/0 medium (0/1/1 intends 1 mg/1 2,4-D and 1 mg/1 kinetin and 0/1/0 
intends no kinetin). The plates were incubated in low light at 22° C. The co-incubation of bacteria with the 
hypocotyl segments took place for 24-48 hours. The hypocotyl segments were removed and placed on 
B5 0/1/1 or 0/1/0 containing 500 mg/l carbenicillin (kanamycin sulfate at 10, 25, or 50 mg/l was sometimes 
added at this time) for 7 days in continuous light (approximately 65 uEm- 2 S- 1 ) at 22°C. The segments were 50 
transferred to B5 salts medium containing to/o sucrose, 3 mg/l benzylamino purine (BAP) and 1 mg/l zeatin. 
This was supplemented with 500 mg/l carbenicillin, 10, 25, or 50 mg/l kanamycin sulfate, and solidified with 
0.6% Phytagar (Gibco). Thereafter, explants were transferred to fresh medium every two weeks. 

After one month green shoots developed from green calli which were selected on media containing 
kanamyicin. Shoots continued to develop for three months. The shoots were cut from the calli when they were 55 
at least 1 cm high and placed on B5 medium with 1% sucrose, no added growth substances, 300 mg/l 
carbencillin , and solidified with 0.6% phytagar. The shoots continued to grow and several leaves were 
removed to test for neomycin phosphotransferase II (NPTII) activity. Shoots which were positive for NPTII 
activity were placed in Magenta boxes containing B5 0/1/1 medium with 10/0 sucrose, 2 mg/l indolebutyric 
acid, 200 mg/l carbenicillin, and solidified with 0.6% Phytagar. After a few weeks the shoots developed roots 60 
and were transferred to soil. The plants were grown in a growth chamber at 22° C in a 16-8 hours light-dark 
cycle with light intensity 220 uEm- 2 s- 1 and after several weeks were transferred to the greenhouse. 
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Southern Data 

Regenerated B. napus plants from cocultivations of Agrobacterium tumefaciens EHA101 containing 
PCGN948 and B. napus hypocotyls were examined for proper integration and embryo-specific expression of 
the spinach leaf ACP gene. Southern analysis was performed using DNA isolated from leaves of regenerated 
5 plants by the method of Dellaporta et al M ( Plant Mol. Biol. Rep. ( 1 983) 1:19-21) and purified once by banding in 
CsCI. DNA (10 ug) was digested with the restriction enzyme EcoRI, electrophoresed on a 0.7<Vo agarose gel 
and blotted to nitrocellulose (see Maniatis et al., (1982) supra .). Blots were probed with pCGN945 DNA 
containing 1 .8 kb of the spinach ACP sequence or with the EcoRI/ Hindi I! fragment isolated from pCGN936c 
(made by transferring the Hindlli/EcoRI fragment of pCGN930 into pCGN566) containing the napin 5' 

10 sequences labeled with 32 P-dCTP by nick translation (described by the manufac turer, BRL Nick Translation 
Reagent Kit, Bethesda Research Laboratories, Gaithersburg, MD). Blots were prehybridized and hybridized in 
500/o formamide; 10x Denhardfs 5xSSC, 0.1 o/ 0 SDS, 5 mM EDTA, 100 ug/ml calf thymus DNA and 100/o dextran 
sulfate (hybridization only) at 42°C. (Reagents described in Maniatis et al M (1982) supra.) Washes were in 
1xSSC, 0.10/0 SDS, 30 min and twice in O.lxSSC, 0.1 0/0 SDS at 55° C~ ~ 

15 Autoradiograms showed two bands of approximately 3.3 and 3.2 kb hybridized in the EcoRI digests of DNA 
from four plants when probed with the ACP gene (pCGN945) indicating proper integration of the spinach leaf 
ACP construct in the plant genome since 3.3 and 3.2 kb EcoRI fragments are present in the T-DNA region of 
PCGN948. The gene construct was present in single or multiple loci in the different plants as judged by the* 
number of plant DNA-construct DNA border fragments detected when probed with the napin 5' sequences 

20 

Northern Data 

Expression of the integrated spinach leaf ACP gene from the napin promoter was detected by Northern 
analysis in seeds but not leaves of one of the transformed plants shown to contain the construct DNA. 
Developing seeds were collected from the transformed plant 21 days post-anthesis. Embryos were dissected 

25 from the seeds and frozen in liquid nitrogen. Total RNA was isolated from the seed embryos and from leaves of 
the transformed plant by the method of Crouch et al., Virology (1985) 140:281-288) and blotted to 
nitrocellulose (Thomas, Proc. Natl. Acad. Sci. USA (1980) 77:5201-5205). Blots were prehybridized, hybridized, 
and washed as described above. The probe was an isolated Pstl/BamHI fragment from pCGN945 containing 
only spinach leaf ACP sequences labeled by nick translation. 

30 An RNA band of -0.8 kb was detected in embryos but not leaves of the transformed plant indicating 
seed-specific expression- of the spinach leaf ACP gene. 

Example II 

35 Construction of B. Campestris Napin Promoter Cassette 

A Bglll partial genomic library of B. campestris DNA was made in the lambda vector Charon 35 using 
established protocols (Maniatis et at., 1982, supra ). The titer of the amplified library was - 1.2 x 10 9 phage/ml. 
Four hundred thousand recombinant bacteriophage were plated at a density of 10 s per 9 x 9 in. NZY plate 
(NZYM as described in Maniatis et al., 1982, supra ) in NZY + 10 mM MgS0 4 + O.90/0 agarose after adsorption - 
40 to DH1 E. coli cells (Hanahan, Mol. Biol . (1983) 166:557) for 20 min at 37°C. Plates were incubated at 37°C for 
-13 hours, cooled at 4°C for 2.5 hours and the phage were lifted onto Gene Screen Plus (New England 
Nuclear) by laying precut filters over the plates for approximately 1 min and peeling them off. The adsorbed 
phage DNA was immobilized by floating the filter on 1.5 M NaCI, 0.5 M NaOH for 1 min., neutralizing in 1 5 M 
NaCI, 0.5 M Tris-HCI. pH 8.0 for 2 min and 2XSSC for 3 min. Filters were air dried until just damp, prehybridized 
and hybridized at 42° C as described for Southern analysis. Filters were probed for napin-containing clones 
using an Xhol/Sall fragment of the cDNA clone BE5 which was isolated from the B. campestris seed cDNA 
library described using the probe pN1 (Crouch et al., 1983, supra ). Three plaques were hybridized strongly on 
duplicate filters and were plaque purified as described (Maniatis et al., 1982, supra ). 

One of the clones named lambda CGN1-2 was restriction mapped and the napirTgene was localized to 
overlapping 2.7 kb Xhol and 2.1 kb Sail restriction fragments. The two fragments were subcloned from lam bda 
CGN1-2 DNA into pCGN789 (a pUC based vector the same as pUC1 19 with the normal polylinker replaced by 
the synthetic linker -5' GGAATTCGTCGACAGATCTCTGCAGCTCGAGGGATCCAAGCTT 3' (which repre- 
sents the polylinker EcoRI, Sail, Bglll, Pstl, Xhol, BamHI, Hindlll). The identity of the subclones as napin was 
confirmed by sequencing. The entire coding region sequence as well as extensive 5' upstream and 3' 
downstream sequences were determined (Figure 2). The lambda CGN1-2 napin gene is that encoding the 
mRNA corresponding to the BE5 cDNA as determined by the exact match of their nucleotide sequences. 

An expression cassette was constructed from the 5'-end and the 3'-end of the lambda CGN1-2 napin gene 
as follows in an analogous manner to the construction of pCGN944. The majority of the napin coding region of 
PCGN940 was deleted by digestion with Sail and religation to form pCGN1800. Single-stranded DNA from 
PCGN1800 was used in an in vitro mutagenesis reaction (Adelman et al., DNA (1983) 2*183-193) using the 
synthetic oligonucleotide 5' GCTTGTTCGCCATGGATATCTTCTGTATGTTC 3TTFTis oligonucleotide inserted an 
EcoRV and an Nco1 restriction site at the junction of the promoter region and the ATG start codon of the napin 
gene. An appropriate mutant was identified by hybridization to the oligonucleotide used for the mutagenesis 
and sequence analysis and named pCGN1801. 

A 1.7 kb promoter fragment was subcloned from pCGN1801 by partial digestion with EcoRV and ligation to 
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pCGN786 (a pCGN566 chloramphenicol based vector with the synthetic linker described above in place of the 
normal polylinker) cut with EcoRI and blunted by fill# # resulting expression cassette, pCGN1803 contains 
1.725 kb of napin promoter sequence, and 1.265 kb of napin 3' sequences with the unique cloning sites Sail, 
Bgll, Pstl, and Xhol in between. Any sequence that requires seed-specific transcription or expression in 
Brassica, i.e., a fatty acid gene could be inserted in this cassette in a manner analogous to that described for 5 
spinach leaf ACP and the B. napus -napin cassette in Example I. 

Example HI 

Other seed-specific promoters may be isolated from genes encoding proteins involved in seed 
triacylglycerol synthesis, such as acyl carrier protein from Brassica seeds. Immature seed were collected from 10 
Brassica campestris cv. "R-500," a self-compatible variety of turnip rape. Whole seeds were collected at 
stages corresponding approximately to 14 to 28 days after flowering. RNA isolation and preparation of a cDNA 
bank was as described above for the isolation of a spinach ACP cDNA clone except that the vector used was 
PCGN565. To probe the cDNA bank, the oligonucleotide (5')-ACTTTCTCAACTGTCTCTGGTTTAGCAGC-(3'J 
was synthesized using an Applied Biosystems DNA Synthesizer, model 380A, according to manufacturer's 15 
recommendations. This synthetic DNA molecule will hybridize at low stringencies to DNA or RNA sequences 
coding for the amino acid sequence (ala-ala-lys-pro-glu-thr-val-glu-lys-val). This amino acid sequence has 
been reported for ACP isolated from seeds of Brassica napus (Slabas et al., 7th International Symposium of * 
the Structure and Function of Plant Lipids, University of California, Davis, CA, 1986); ACP from B. campestris 
seed is highly homologous. Approximately 2200 different cDNA clones were analyzed using a colony 20 
hybridization technique (Taub and Thompson, Anal. Biochem . (1982) 126:222-230) and hybridization 
conditions corresponding to Wood et al., ( Proc. Natl. Acad. Sci . (1985) 82:1585-1588). DNA sequence analysis 
I of two cDNA clones showing obvious hybridization to the oligonucleotide probe indicated that one, 

designated pCGNIBcs, indeed coded for an ACP-precursor protein by the considerable homology of the 
encoded amino acid sequence with ACP proteins described from Brassica napus (Slabas et al., 1980 supra ). 25 
Similarly to Example II, the ACP cDNA clone can be used to isolate a genomic clone from which an expression 
cassette can be fashioned in a manner directly analogous to the B. campestris napin cassette. 

Other Examples 

Ninety-six clones from the 14-28 day post-anthesis B. campestris seed cDNA library (described in the 30 
previous example) were screened by dot blot hybridization of miniprep DNA on Gene Screen Plus nylon filters. 
Probes used were radioactively labeled first-strand synthesis cDNAs made from the day 14-28 post-anthesis 
mRNA or from B. campestris leaf mRNA. Clones which hybridized strongly to seed cDNA and little or not at all 
to leaf cDNA were catalogued. A number of clones were identified as representing the seed storage protein 
napin by cross-hybridization with an Xhol/Sall fragment of pNl (Crouch et al., 1983, supra ), a B. campestris 35 
genomic clone as a source of an embryo-specific promoter. 

Other seed-specific genes may also serve as useful sources of promoters. cDNa clones of cruciferin, the 
other major seed storage protein of B. napus , have been identified (Simon et al., 1985, supra ) and could be 
used' to screen a genomic library for promoters. 

Without knowing their specific functions, yet other cDNA clones can be classified as to their level of 40 
\ expression in seed tissues, their timing of expression (i.e., when post-anthesis they are expressed) and their 

} approximate representation (copy number) in the B. campestris genome. Clones fitting the criteria necessary 

for expressing genes related to fatty acid synthesis or other seed functions can be used to screen a genomic 
library for genomic clones which contain the 5' and 3' regulatory regions necessary for expression. The 
non-coding regulatory regions can be manipulated to make a tissue-specific expression cassette in the 45 
general manner described for the napin genes in previous examples. 

One example of a cDNA clone is EA9. It is highly expressed in seeds and not leaves from B. campestris . It 
represents a highly abundant mRNA as shown by cross-hybridization of seven other cDNAs.from the library by 
dot blot hybridization. Northern blot analysis of mRNA isolated from day 14 seed, and day 21 and 28 
post-anthesis embryos using a 700 bp EcoRI fragment of EA9 as a probe shows that EA9 is highly expressed 50 
at day 14 and expressed at a much lower level at day 21 and day 28. The restriction map of EA9 was determined 
and the clone sequenced. Identification of a polyadenytation signal and of poiyA tails at the 3'-end of EA9 
confirms the orientation of the cDNA clone and the direction of transcription of the mRNA. The partial 
sequence provided here for clone EA9 (Figure 3) can be used to synthesize a probe which will identify a unique 
class of Brassica seed-specific promoters. 55 

It is evident from the above results, that transcription or expression can be obtained specifically in seeds, so 
as to permit the modulation of phenotype or change in properties of a product of seed, particularly of the 
embryo. It is found that one can use transcriptional initiation regions associated with the transcription of 
sequences in seeds in conjunction with sequences other than the normal sequence to produce endogenous 
or exogenous proteins or modulate the transcription or expression of nucleic acid sequences. In this manner, 60 
seeds can be used to produce novel products, to provide for improved protein compositions, to modify the 
distribution of fatty acid, and the like. 

All publicatons and patent applications mentioned in this- specification are indicative of the level of skill of 
those skilled in the art to which this invention pertains. All publications and patent applications are herein 
incorporated by reference to the same extent as if each individual publication or patent application was 65 
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Claims 

1. A seed comprising an expression cassette, said cassette comprising a seed specific transcriptional 
initiation region, a sequence of interest under the transcriptional regulation of said initiation region, and a 
transcriptional termination region, said expression cassette inserted into the genome of said seed at 
other than the natural site for said transcriptional initiation region. 

2. A seed according to Claim 1 , wherein said sequence of interest is an open reading frame encoding an 
endogenous protein or mutant thereof. 

3. A seed according to Claim 1 , wherein said sequence of interest is an open reading frame encoding an 
exogenous protein. 

4. A seed according to Claim 1 , wherein said sequence of interest encodes a sequence complementary 
to a transcription product of said seed. 

5. A seed according to Claim 1 1 wherein said seed is of the Brassica family. 

6. An expression construct comprising a seed specific transcriptional initiation region, a polylinker of 
less than about 100 bp having at least two restriction sites for insertion of a DNA sequence to be under" 
the transcriptional control of said initiation region, and a transcriptional termination region, the sequence 

* of said polylinker being other than the sequence of the gene naturally under the transcriptional control of 
said initiation region. 

7. An expression construct according to Claim 6, wherein said transcription initiation region is from a 
Brassica seed gene. 

8. An expression construct according to Claim 7, wherein said gene is a napin gene. 

9. An expression cassette comprising a seed specific transcriptional initiation region, a DNA sequence 
of interest, other than the natural sequence joined to said initiation region, to be under the transcriptional 
control of said initiation region, and a transcriptional termination region. 

10. An expression cassette according to Claim 9, wherein said transcriptional initiation region is a 
Brassica gene initiation region. 

1 1 . An expression cassette according to Claim 10, wherein said gene is a napin gene. 

12. An expression cassette according to Claim 10, wherein said sequence of interest is a structural gene. 

13. An expression cassette according to Claim 12, wherein said structural gene encodes a protein in the 
biosynthetic pathway for fatty acid production. 

14. An expression cassette according to Claim 13, wherein said protein is acyl carrier protein. 

15. A vector comprising an expression cassette according to any of Claims 9 to 14, a prokaryotic 
replication system, and a marker for selection of transformed prokaryotes comprising said marker. 

16. A method for modifying the genotype of a seed comprising: 

growing a plant to seed production, wherein cells of said plant comprise an expression cassette 

according to any of Claims 9 to 14, 

whereby seeds are produced of modified genotype. 



-12- 



^ime^ f? ^- "s\ • - ' 



Best Available Copy 



0255378 



NAP-27C Linear 



I2NGT3 - 702 



FokZ 



SfaNZ 
Maelll 



MboII 



1 AAAGGGATGTGTCTTGGTATGTATCTACGAGTAACAAAAGAGAAGATGCAATTGAGTAGTAGAASGATTTG 

15 31 55 

36 



Rgal 



Alul Ahalll rokl EcoRV 

1 I II, 

72 AGAGCTTTT45AAGCC34TCASGTGTGTGCTTT4ATCTTATTGATATCATCCAT74GCGTTGTTTAATGC3 
76 32 106 . H7 130 



'Ddel 

143 TCTTTAGATATGTTTCTGTTTCTTTCTCAGTGTCTGAATATCTGATAAGTGCAATGTGAGAAAGCCACACC 2 

169 



Sap! 
I 



TaqT 
Hinfl 



214 AAACCAAAATAT?CAAATC7TATATTT?TAATAATGTCGAATCACTCGGAGTTGCCACCTTCtGTGCCAAT 2 
224 253 

251 



HlafZ MboII ScoRl 

I I I 

235 TGTGCTGAATCTATCACACTAAAAAAAACATTTCTTCAAGGTAATGACTTGTGGACTATCGTTCTGAATTC 3 
292 310 351 



Maell 

356 TCATTAAGTTTTTATTTTTTGAAGTTTAAGITTTTACCTTCTTTTTTGAAAAATATCGTTCATAAGAIGTC 4 

424 



SphI 

ScrFZ NlaZZZ Nap (7524) I 

Scorii Alul SfaNI NlaZZZ HaelZZ 

II II II | 

427 ACGCC AGGACATGAGCTACACA7CACATATTAGCATGCAGATGCGGACGATTTGTCACTCACTTCAAACAC 
43C 442 457 464 430 

432 440 464 

464 
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Sspl PmaCI 
?*J ini1 Ndel Nap (7524) I Maell 



Alul 



Avalll Nlaiil Aflui upnl Nlalll 



498 CTAAAAGArcT7CTCT^^ 

508 531 539 550 

539 553 
543 551 

HphI Mnll 

I I 

569 |C^TTC?CACCTATAAATTAGAGGCTCGGCTTCA^ g 

Mbol I Alul Mnll 

6 4 0 TA CACAAAZ2GC6AACAAGCTCTTCCTCGTCTCGGCAACTCTCGCCTTSTTCTTCCTTCTCAC 70 2 
> 653 659 675 

SITS [NAP-270] : Q 



gena 



^iha^t^P?^ °!, t 5! P ron!0t « wgipn of the ABnNa napin 
#< irV tart ^ TG) of the °P en reading frame is underlined 
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-Lambda CGNl-2 
NCG-I86 Linear 



LENGTH - 4325 



Xhol 

hit? HindHI 

II Alul Taql . 

i =tcgaggcagtcactmcatgaagt?tgacgju;gagcccaactatG(Kaagcttatttctcttttccat 69 

2 3 52 
2 * 50 



Sad 

Hhai xbal Alul 

70 ACTCTAATTGAGCCG7GCGCTCTA7CTAGACCAATTAGAATTGATGGAGCTCTAAAGGTTGCTGGCTGT 138 

89 95 119 

121 

*** Ndel 
1 3 9 TTTCTTGTTCATATGATTAACTTCT AAACTTGTGTATAAATATTCTCTGAAAGTGCITCTTTTGGCATA 207 
150 206 . 

208 T5TAGGTTGGGCAAAAACGAGGAAGATTGCTTCTCAATTTGGAAGAGGATGAACAGCC5AAGAAGAAAA 276 



Sau3AI 
DcLel 

^277 TAAGAATAGGCAGTCCTGC7ACTCAATGGATCTCAGTCTATAACGGTCGTCGTCCCATGAAACAGAGGT 345 

309 

305 

ZcoRV 

346 AAAACATTTTTTGCATATACACTTTGAAAGTTCCTCACTAACTGTGTAATCTTTTGGTAGATATCACTA 4 1 4 

408 
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Hindi 
Hhal 

Haelli 

Ddel 
■Bstfill 

^all Haelll Alul 

415 CAATGTCGGAGAGACAA3GGCTGMNCANCATATACAAAAGGGAAATGAAGATGGCCTTTTGATTAGCTG 433 

«9 469 481 

438 

439 

439 
440 
438 

Alul Kinfl 

| 484 -3TAGCATCAGCAGC?AATCTCTGGGCTCTCATCATGGATGCTGGAACTGGATTCACTTCTGIAGTTTA 552 

498 . 535 

Mapl 

*F«« Hinfl 

I I 

553 7GAGTTGTCACCGGTCTTCCTACACAAGGTAATAATCAGTTGAAGCAATTAAGAATCAAT7TGATTTGT 621 

ill 604 

Ddel 

I 

622 AGTAAACTAAGAAGAACTTACCTTATGTTTTCCCCGCAGGACTGGATTATGGAACAATGGGAAAAGAAC 690 
629 

Sad 

Alul Alul Alul 
691 TACTATATAAGCTCCATAGCTGGTTCAGATAACGGGAGCTCTTT AGTTGTTATGTCAAAAGGT7AGTGT 759 

702 710 729 

731 

7 60 TTAGTGAATAATAAACTTATACCACAAAGTCTTCATTGACTTATTTATATACTTGTTGTGAATTGCTAG 828 



Ddel fiinfl 
829 5AACTACTTATTCTCAGCAGTCATACAAAGTGAGTGACTCATTTCCGTTCAAGTGGATAAATAAGAAAT 897 

342 865 
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* imI TaqI 

399 GGAAAGAAGATTTTCATGTAACCTCCATGACAACTGCTGGTAATC3XTGGGGTGTGG7AATCTCSAGGA 966 
908 



961 



Sau3AI 
Bell 

967 ACTCTGGCTTCTCTGATCAGGTAGGTTTTTGTCTCTTATTGTCTGGTGtTTTTATTTTCCCCTGATAGT 1035 

981 
981 

AluZ Raal 

^036 CTAATATGATAAACTCTGCGTTGTGAAAGGTGGTGGAGCTTGACT7TTTGTACCCAAGCGATGGGATAC 1104 

1074 1087 

S«u3AI Alul 

1105 ATAGGAGGTGGGAGAATGGGTATAGAATAACATCAATGGCAGCAACTGCGGATCAAGCAGCTTTCATAT 1173 

1155 1163 

Seal 

BiafZ Raal 
1174 TAAGCATACCAAAGCGTAAGATGGTGGATGAAACTCAAGAGACTCTCCGCACCACCGCCTTTCCAAGTA 1242 

1215 1242 

1242 

Alul Sau3Al Ddei 

1243 CTCATGTCAAGGTTGGTTTCTTTAGCTTTGAACACAGATTTGGATCTTTTTGTTTTGTTTCCATATACT 1311 

1268 1285 1311 

Odel 

Avail Alul Hinfl Raal 

III II 
1312 T AGGACCTGAGAGCTTTTGGT7GATTTTTTTTTCAGGACAAATGGGCGAAGAATCTGTACATTGCATCA 1380 

1315 1325 1363 1370 

1319 

1381 ATATGC7ATGGCAGGACAGTG7GCTGATACACACTTAAGCATCATGTGGAAAGCCAAAGAGIATTGGAG 1449 
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0 25 5 37 8^' v"" 



Hinfl 
Ddel 

II 

1450 CGAGACTCAGGGTCGTCATAATACCAATCAAAGACGTAAAACCAGACGCAACCTCTTTGGTTGAATGTA 1313 

1456 
1454 

Hsal 

1519 ATGAAAGGGATGTGTCTTGGTATGTATGTACGAATAACAAAAGAGAAGATGGAATTAGTAGTAGAAATA 1587 

1548 



Alul EcoRV 
' I 
3 a 8 TTTGGGAGCTTT7TAAGCCCTTCAAGTGTGCT7TTTA7C77A77GA7A7CATCCATTTGCG77G77TAA 1 65 6 



1596 1635 



Xbal Ddal 
I I 
1 S 3 7 T.GCGTCTCTAGATATGTTCCTATATCTTTCTCAGTGTCTGA7AAGTGAAATGTGAGAAAACCATACCAA 172 S 

1664 1687 

Hinfl 

1726 ACCAAAASATTCAAATCTTATTTTTAA7AATG7TGAATCACTCGGAGTTGCCACCTTCTGTGCCAATTG 17 9 4 

1761 

Hinfl EcoRI 
I | 
1795 TGC7GAATCTATCACACTAGAAAAAAACATTTCTTCAAGGTAATGACTTGTGGACTATGTTCTGAATTC 18 63 

1800 1859 
1864 7CAT7AAG77777A77T7C7GAAG7TTAAG7TTT7ACCTTCTGTTTTGAAATATATCGTTCATAAGA7G 1932 



SphI 

BatNI Alul Sau3AI 
1933 7CACGCCAGGACATGAGCTACACATCGCACATAGCATGCAGATCAGGACGA7T7G7CAC7CAC77CAAA .2001 

1940 1950 1973 

1971 
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SphI 

Ddal Alul Hhal Ndal Nail Sau3AI 

'I I I'll | 

2002 CACCTAAGAGCTTCTCTCTCACAGCGCAC ACACATATGCATGCAATATTTACACGTGATCGCCATGCAA 2070 

2006 2012 2028 2036 2042 2058 

2044 

20 7 1 ATCTCCATTCTCACCTATAAATTAGAGCCTCGGCTTCACTCTTTACTCAAACCAAAACTCATCACTACA 2139 



Alul 

2140 GAACATACACAAATGGCGAACAAGCTCTTCCTCGTCTCGGCAACTCTCGCCTTGTTCTTCCTTCTCACC 2208 

MMAlaAanLysLeuPheLeuValSerAlaThrLauAlaLeuPhaPheLeuLeuThr 
2164 

Taqi Nael 
Sail _ Map I 

Hindi Epall 
AccI AccI Haelll 

I III || I 

2*C 3 AATGCCTCCG7CTACAGGACGGTTGTGGAAGTCGACGAAGATGATGCCACAAATCCAGCCGGCCCATTT 2277 
AanAlaS«rValTyrArgThtValValGluValAapGluA«TjA8pAlaThrAsp.ProAiaGlyProPha 
2220 2241 2271 

223d 2268 
2240 2268 
2241 2269 

Hindi II 

Hinfl Alul 
I I | 

^78 AGGA7TCCAAAATGTAGGAAGGAGTTTCAGCAAGCACAACACCTGAAAGCTTGCCAACAATGGCTCCAC 2346 
ArgIl*PsoLyaCyaArgl,y8GluPheGlnGlnAlaGlnHiaLauLy8AlaCy3GlnGlnTrpLeuHi8 
2281 2327 

2325 

Mapl Avail 
Hpall Avail Alul Taqi 

1 I I I I 

2347 AAGC AGGCAATGCAGTCCGGTAGTGGTCCAAGCTGGACCC7CGATGGTGAGTTTGATTTTGAAGACGAC 2415 
LysGlnAlaWETGlnSerGlySarGlyProSerTrpThrLeuAapGlyGluPhaAapPheGluAspAsp 

?364 2372 2379 2388 
2364 2332 
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Ha«IXI Sad 
Apal fiaelll AIuI 

III II 
2 416 GTG3 AG AACC^CAAC^GGCJCCCGCAGCAGAGGCCACCGCTGCTCCAGCAaTGCTGCAACGAGC'rCCAC 2484 
ValGluAanGlnGlnGlnGIyProGlnGlnArgProProLauLauGlnGlnCysCy9AanGluLeuHi3 

2439 2449 2479 

2436 248L 

Taql 

BstNI Hinfl 
I I | 

2435 CAGGAAGAGCCACT?TGCGTTTGCCCAACCTTGAAAGGAGCATCCAAAGCGGT7AAACAACAGATTCGA 2553 
GlnGluGluSroLauCysValCysProThrLauLyaGlyAlaSerLysAlavaliysGlnGlnlleArg 
2486 2548 

2551 

2554 CAACAACAGGGACAACAAATGCAGGGACAGCAGATGCAGCAAGTGATT AGCCGTATCTACCAGACCGCT 2622 
GlnGlaGIaGlyGlRSInifiTGlnGlyGlnGinMETGlnGlnVallleSerArglleTyrGlr.ThiAla 

Alul BstNI 
I I 
2523 ACGCACTTACCTAGAGCTTGCAACATCAGGCAAGTTAGCATTTGCCCCTTCCAGAAGACCATGCC7GGG 2S91 
TlirHiaiiattPffeArgAlaCyaAsnlleAsgGlrtValSMllaCysProPheGlnLyaThrMSTProGly 
2639 2688 

MspX 

Epall Xhol 
SaaSII Taql 

a?s: air.fi Aval Accl 

I I I III 

2592 CCC5GCTTCTACTAGAT7CCAAACGAATATCCTCGAGAGTGTGTATACCACGGTGATATGAGTGTGGTT 27 SO 
?roGIy?h<3Tyr » 

2594 2707 2724 2736 

2692 2725 
2694 2724 
2634 

HincSS Raal 
I I 
2761 GTTGATGTMG:TAACACTACATAGTCATGGTGTGTGTTCCATAAATAATGTACTAATGTAATAAGAAC 2829 

2771 2313 

AceZ 

i 

2830 TACTCCG^AGACGGTAATAAAAGAGAAGTTTTTTTTTTTACTCTTGCTACTTTCCTATAAAGTGATGAT 2898 
2338 
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Seal 
RaaZ 

2899 TAACAACAGATACACCAAAAAGAAAACAATTaATCTATATTCACAATGAASCAGTACTAGTCTAITGAA 2967 

2954 . 
2954 

Sa.u3Al 

2963 CATGTCAGATTTTCTTTTTCTAAATGTCTAATTAAGCCTTCAAGGCTAGTGATGATAAAAGATCATCCA 30 3 6 

3028 

Sau3Al Sau3AI 
ButHZ . Hinfl Bell 

III 
3C37 ATGGSATCCAACAAAGACTCAAATCTGGTTTTGATCAGATACTTCAAAACTATTTTTGTAT7CATTAAA 3105 

^ 3041 3053 3069 

3041 3069- 

Hinfl 

3106 TTATGCAAGTGTTCTTTTATTTGGTGAAGACTCTTTAGAAGCAAAGAACGACAAGCAGTAATAAAAAAA 3174 

3135 

3175 ACAAAGTTCAGTTTTAAGATTTGTTATTGACT'JATTGTCATTTGAAAAAT&TAGTATGATATTAATATA 3243 
3244 G?TTTATTTATATAATGCTTGTCTATTCAAGATTT5AGAACA2TAATATGA?ACTG?CCACATATCCAA 3312 

Ndal 

3313 TATATT AAGTTTCATTTCTGTTCAAACATATGATAAGATGGTCAAATGATTATGAGTTrTGTTATTTAC 3381 

33(11 

TaqI 3au3AI 
Alul RsaX 

3392 CTGAAGAAAAGATAAGT5AGCT7CGAGTTTCTGAAGGGTACGTGASCTTCATTTCTTGGCTAAAAGCGA 34 50 

3402 3421 
3405 3425 



3451 ATATGACATCACCTAGAGAAAGCCGATAATAGTAAACTCTGTTC7TGGTTTTTGGTTTAATCAAACCGA 3519 
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Map I 

Mspl Ddel Hpall 
Hpall Alul Ndel Hinfl 

I I I I II 

3520 ACCGGTAGCTGAGTGTCAAGTCAGCAAACATCGCAAACCATATGTCAATTCSTTAGATTCCCGGTTT AA 3588 

3S22 3528 3560 3576 

3522 3529 358 1 

. 3581 

Map I 

Hps 1 1 

I 

3 58 9 GTTGT AAACCGGTATTTCATTTGGTGAAAACCCTAGAAGCCAGCCANCCTTTTTAATCTAATTTTTGCA 3657 

3598 
3598 

Hinfl 
Hindi 
Ddfcl BstNI 

3658 AACGAGAAGTCACCACACCTCTCCACTAAAACCCTGAACCTTACTGAGAGAAGCAGAGNCA^'AAAGAA 3726 

3702 3718 
3715 
3714 

3727 CAAATAAAACCCGAAGATGAGACCACCACGTGCGGCGGGACGTTCAGGGGACGGGGAGGAAGAGAATGR 3795 



Avail 

Aiul Avail 
I I I 
3796 CGGCGG5MNTTTGGTGGCGGCGGC5GACGTTTTGGTGGCGGCGGTGGACGTTTTGGTGGCGGCGGTGGA 3864 

3804 3863 
3801 

EcoRV Avail Ddel 

II | 
3865 CCTTTGGTGGTGGATATCGTGACGAAGGACCTCCCAGTGAAGTCATTGGTTCGTTTACTCTTTTCTTAG 3933 



3880 3892 



3930 



Ta ?„> T Hindlll 

, ? infI Alui Ddei 

II II I 

3934 TCGAATCTTATTCTTGCTCTOCTCGTTGTTTTACCGATAAAGCTTAAGACTTTATTGATAAAGTTCTCA 4002 

,Jf 37 3976 4000 

3935 3S74 

* luI *»* HiafI Ddei 

I II 

4CC3 GC7TTGAATGTGAATGAAeTG.TTTCCTG$TTA^^ 4C71 
4004 4059 4069 

Hinfl 

4072 GCACTTTTGTTAGATTCATCTTTGTGTTTAAGTTAAAAGGTAGAAAeTTTSTGMTTGTCTCCGTTATG 4140 

4085 

Hindi 

I 

4141 ACAAGGTTAACTTTGTTGGTTATAACAGAAGTTGCSACCTTTCTCl^TGCTTGTGAGGGTGATGCTGTG 4209 
4146 

Avail Alul Ddei 5«u3AI 
I I I | 

4210 GACCAAGCTCTCTCAGGCGAAGATCCCTTACTTCAATGCCCCAATCTACTTGGAAAACAAGACACAGAT 4273 
4210 4217 4222 4231 

TaqX 

Sail 
PatI 

Hindi II Hindi 
Sau3AI Alul Aecl EcoRI 

427 9 TGGGAAAGTTGATGAGATCCAAGCTTGGGCTGCAGGTCGACGAATTC 4323 

4294 4302 4316 4321 

4300 4314 
4313 
4315 
4316 
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SfaNI 

I 

3SS GAAATGGTAAGTTTATGTGATGCAAAAGATTTGAT^mm^MGGGGGGGGGGGG. 415 
365 



Partial sequence of B. camoe atria teed. cDNA. EA9 suf f icient 
to identify a genomic clone containing * promoter. The paly- 
adenylation sign* I AATAAA ia underlined .as ate the polyA tail*. 
The itop ccdon of the presumed open reading frame- ia double underlined 
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